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Dedicated to our late friend and admired mathematician Tom Wolff 



Abstract. We prove uniform L p estimates (Theorem 1.1) for a family of paraproducts 
and corresponding maximal operators. 



1. Introduction 

The purpose of this article is to prove certain uniform estimates on paraproducts. 
Our motivation for this result is that we need it in a sequel || of this paper, where we 
study uniform estimates for multilinear singular integrals as in || with a one parameter 
modulation symmetry. Despite our current narrow objective, the uniform paraproduct 
estimates in the current article may be of independent interest by themselves. 

As motivation, we begin by defining a standard paraproduct form of degree n > 2 to 
be a multilinear form of the type 



„ n 

A(/i, ...,/„) = / ^2Y[QiMx)dx 

</]R ,.^r, ._ 1 



^ fcez i=i 

where the Qi^ are convolution operators 

Qi,kf = f * 4>i,k 

with convolution kernels whose Fourier transforms 0^ are bump functions adapted to 
{£ : |£| < 2 fc } and we assume 

fi, fc (0) = ? a ,*(0) = (2) 

for all k. Here and in the sequel, a bump function adapted to an interval / is a function 
supported in / and satisfying 

U {a) \\ < \i\~ a 



for all derivatives of order < a < N for some large (see JIH 

Classical Calderon-Zygmund theory gives the following standard estimate: 

n 

A(/ 1) ...,/ B )<C7 p ni|/ < ||„ (3) 

i=i 

where 1 < pi < oo are any exponents satisfying the homogeneity condition 

n 

E 1 = 1 - w 

and C p is a constant depending only on the tuple (j>i). 
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To see this estimate we use Holder's inequality to estimate the left hand side of @ by: 

/2 / oo \ 2 

i=l \fc=l / t>2 keZ 

2 

<nw*n»iiii M /<ii» • 

i=l i>2 

Here 5 denotes an appropriate Littlewood-Paley square function operator, and M de- 
notes the Hardy-Littlewood maximal operator. Now the above claim follows easily from 
boundedness of S and M in LP for 1 < p < oo. Observe that the same proof works if we 
replace Qi.ft by Qi,k+ki in (Q) f° r some arbitrary integers fcj. 

It is also well known (||) that it is sufficient to require condition (0) only for <pi k- We 
shall sketch a proof under the more restrictive assumption that vanishes of {£ : |£| < 
2 k ~ L } for some given L > 1. Indeed, in this case we can write for each z > 1 

n+L 

Qi,k ^ ^ Qi,k,l j 

where the symbol of the Fourier multiplier Qi,JM> is adapted to {£ : |£| < 2 k ~ 1 } and 
vanishes on the smaller interval {£ : |^| < 2 k ~ l ~ 2 } except in the case I = n + L when we 
do not require any vanishing. Then we can split A(/i, . . . , /„) as in ([[]) by the distributive 
law into a sum of polynomially in L many multilinear operators, all of which are multiples 
of standard paraproducts after possibly permuting the indices, except for the term 

Ql,kfl(x) ]~[ Qj,k,n+Lfi(x) dx . 
k i>l 

However, this term simply vanishes, because the Fourier transform of 

JJ(/i * <fri,k,n+L) 
i>l 

is supported in {£ : |£| < n2 k ~ n ~ L } whereas the Fourier transform of 



fi * <Pi,k 

vanishes on {£ : |^| < 2 k ~ L }, so the inner product of these two vanishes by Plancherel. 

Observe that in the above argument, the constant in the estimate for A depends on 
L. However, this dependence is artificial as one can show a bound (0) with a constant 
independent of L and independent of any constants fcj as above that we may insert in 
the definition of A. The purpose of this article is to prove a very general version of this 
uniformity result: 

Theorem 1.1. Let n > 2, and let Q G Z n be a set of the form 

£1 = . . . , j n ) : Jla > j t , a + A a for alll<a<K} (5) 

where K > is an integer and i a , i' a e {1, . . . , n}, A a G Z for all 1 < a < K. For each 
1 < i < n and j G Z, let Qj.i be a Fourier multiplier whose symbol is a bump function 
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adapted to {£ : |£| < 2 J } that vanishes at the origin. Then one has the estimates 



E II 



R (ii,-,in)en i=\ 



« llpi 



i=l 



sup 

inez 



n-l 



e n 

(ji,...,j„)eo i=l 



< a 



n-l 
i=l 



l\\Pi 



(6) 



(7) 



for all 1 < Pi < oo obeying (|]) ; where l/p' n + l/p n = 1 and the constant C PjTI! k depends 
on the pi, n, K , but is independent of the Ai and fi. 

The set Q in this theorem is a convex polytope in Z n which is invariant under translation 
in direction (1, . . . , 1). It has essentially K faces. We remark that the number K of 
independent constraints in fl5|) can be bounded by a function of n, so the dependence of 
C P) n.K on K serves only to emphasize that we have better estimates for small K. 

To compare this result further with the existing literature, we observe that we can write 
a paraproduct as in ([]]) or (|6]) in the multiplier form 



„ n 

I m(£ 1 ,...,£ n )T\fi(£i)do- 

JT.£i=0 



'E€i=o 

where the multiplier m satisfies the symbol estimates 

d a m(i) < C\i\-^ (8) 

for any multi index a up to some fixed large order and for all vectors £. Under this 
condition on the multiplier, standard paraproduct theory (e.g. 0-0) gives @. However, 
for the multipliers arising from Theorem [LI] , the constant in (|8[) depends on the set Q. 
Hence the point of the theorem is uniformity in Q. 

The multipliers of the theorem do satisfy a bound of the type 



(napMO<c7]Ii6 



(9) 



with a constant C independent of Q. However, as is shown in 0, this condition is 
in general not sufficient to guarantee an estimate (§). If the multiplier m is of tensor 
product type 



i=l 



in addition to satisfying @ , then the integrand in the definition of A splits into a tensor 
product and one can trivially show (Q) by reducing to the one dimensional case. Observe 
that the multipliers arising in Theorem [Ll| do not split as tensor products because of the 
constraints on Q. However, the idea of the theorem is that the special structure of Q still 
allows to recover enough of the good behaviour of the tensor product case. 
We shall not persue here results for endpoints Pi = oo or p { = BMO. 

It is obtained by induction on the dimension n 



We prove Theorem |LT| in Sections 
of Q. The main tools are the Holder, Littlewood-Paley, and Hardy-Littlewood maximal 
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inequalities, together with some combinatorial manipulations to break f2 up into simpler 
objects. The latter shall be most easily accomplished using the language of graph theory. 

We shall specialize Theorem |TT| to a form which will be convenient for our application 
to multilinear singular integrals with modulation symmetries. 

Corollary 1.2. Let n > 2, and let M 1; . . . , M n be integers. For each 1 < i < n and 
j G Z, let iTj ; i be a Fourier multiplier whose symbol is a bump function adapted to {£ : 
|£| < 2 J+A/i }. Suppose that for each k G Z there exists at least one 1 < i < n such that 
the symbol of ttj^ vanishes at the origin. Then one has the estimate 



E 

j 



/n n 
H*jM < CnnjcllWMn (10) 



for all 1 < pi < oo obeying ^" =1 — = 1, where the constant C v ^ n ^K depends on the p i; n, 
K, but is independent of the Mi and fi . 



We prove this Corollary in Section |5|. It shall follow easily from Theorem |T7T] after some 
normalizations and dyadic decompositions on the ttj^. It is easy to see that some sort of 
cancellation condition on the 7^, such as the vanishing condition given above, is necessary, 
as can be seen by testing ([n]) when the fi are all equal to the same approximation to 
the delta function at the origin. It is important that the absolute values in (|10| ) are not 
inside the integral, as the estimate is easily seen to be false otherwise. 

The first author was partially supported by a Sloan Dissertation Fellowship. The second 
author is a Clay Prize Fellow and is supported by grants from the Sloan and Packard 
Foundations. The third author was partially supported by a Sloan Fellowship and by 
NSF grants DMS 9985572 and DMS 9970469. 

We would like to point out that Tom Wolff's first major result, his simplification of the 
proof of the corona theorem, heavily relied on estimates for paraproducts. 

2. Notation and preliminaries 

It will be convenient to write the paraproducts using the language of graph theory. 

Definition 2.1. Let V be a finite set. A (weighted directed) edge supported in V is a 
triplet e = (vi,v 2 ,A) G V x V x Z. We refer to V\ =: V\{e), v 2 =: v 2 (e), and A(e) as 
the initial vertex, final vertex, and weight of e respectively. A (weighted directed) graph 
G = (V, E) is a finite collection of vertices V and a finite collection E of weighted directed 
edges e supported in V . We call \ V\ and \E\ the order and complexity of G respectively. 

We allow multiple edges from one vertex to another, as well as edges from a vertex to 
itself, although such edges are usually redundant or trivial in our applications. We shall 
henceforth omit the modifiers "weighted directed" in the sequel. 

If p is a directed path in G, we define the weight A{p) of p to be the sum of all the 
weights of the edges in G. If V is a subset of G, we define G\v> to be the graph with 
vertex set V and edges {e G E : t>i(e),f 2 (e) G V'}. We write G — V for G\v\y- 

If E' is a set of edges whose vertices are in V, we write GUE' for (V, EUE') and G — E' 
for (V, E — E'). 

If G = (V,E) is a graph and Vi, V 2 are disjoint subsets of G, we say that Vi, V 2 are 
adjacent if there exists an edge in G with one vertex in V\ and the other in V 2 . 
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Every graph can be associated with a polytope: 

Definition 2.2. Let G = (V,E). We define the polytope Q(G) C Z v of G to be the set 

ft(G) := {(j v )vev e Z v : j^e) > j U2(e) + A(e) for all e G E}. 

We shall often write j for {j v )v<=y- 

Note that Vt{G) is empty if G contains directed circuits of positive weight. (This 
implication can be reversed, but we shall not need this here). Conversely, if E is empty, 
then Q(G) = Z V . 

Definition 2.3. Ife = (v\, t>2, A) is an edge, we define the duale* of e by e* := (t>2, v\, —A+ 
I)- 

We observe the basic relationship 

n(V, E) = n(V, E U {e}) tt) Q(V, E U {e*}) (11) 
whenever (V, E) is a graph and e is an edge with vertices in V, and t±J denotes disjoint 



union. The identity ([TlD allows us to replace an edge e in a graph by its dual, modulo 
objects of lesser complexity. 

Theorem |1.1| is concerned with a certain graph of order n. We shall fix this n: in what 
follows all graphs shall have order less than n. Observe that while we do not exclude 
multiple edges, in what follows we can easily reduce graphs to not contain multiple edges, 
and thus the complexity of all graphs can be assumed to be less than (n + l) 2 - For each 
v G V and j G Z, we fix Q v - to be a Fourier multiplier whose symbol is a bump function 
adapted to {£ : 2 10nj < |f | < 2 10nj+1 }. We use powers of 2 10n rather than 2 to obtain 
better lacunary separation properties. 

We shall need two types of paraproducts: non-maximal paraproducts, which we de- 
fine as multi-linear forms; and maximal paraproducts, which we define as multi-linear 
operators. 

Definition 2.4. Let G = {V,E) be a graph with \V\ < n, and suppose that f = (/ c )„ e y 
is a family of test functions on R. We define the non-maximal paraproduct Aq to be the 
quantity 



Mf) ■■= £ / 119*./ 

7^o.(n\ veV 



We define the order and complexity of Ac to be the order and complexity of G respectively. 

Definition 2.5. Let G — (V tfcl {i/*},^) be a graph with a distinguished vertex t>* and 
\V\ < n, and suppose f = (f v )vev is a family of test functions on R. We define the 
maximal paraproduct A^*' to be the function 

aW(/) = bup| Yl UQW- 

J * e Jen(ay.j v *=j*vev 
We define the order and complexity of A^ to be \V\ + \ and \E\ respectively. 
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Note that a maximal paraproduct on n functions is slightly higher order than a non- 
maximal paraproduct on n functions, but slightly lower order than a non-maximal para- 
product on n + 1 functions. From ([11]) we observe the identity 

A G (/5 = A Gu{e} (/) + W-}(/5 (12) 

and the triangle inequalities 

AS^/I^A&Wm+A&W^/) (13) 

and 

Agu ] {e} (/)<Afr ] (/)+Ag^ } (/) (14) 

whenever e is an edge with vertices in the vertex set of G. 
We shall prove 

— * 

Theorem 2.6. • Let G = (V, E) be a non-empty graph with \V\ <n, and f = (f v ) ve v 
a family of test functions. Then we have 

|A G (/)i<C w , n nilMU (15) 

whenever 1 < p v < oo and Yuvev ^-IVv — 1- 

• Let G — (V l±l {w*}, -E) fre a non-empty graph with \ V\ < n, and f = {f v ) v ^v a family 
of test functions. Then we have 

iiAg* ] (/)iu<^, W)ri nii/«iu as) 

vev 

whenever 1 < p*,p v < oo and X^ev V?*u = Vp*- 

Theorem LI then follows easily from rescaled versions of (|l"5|) and a dyadic decompo- 
sition of the multipliers Qkj- 

The theorem is easily verified from the standard linear theory of maximal truncated 
singular integrals (see e.g. |H|) when the order of the paraproduct is l| (the lowest order 
in which the statement is not void). Now suppose the order is at least two and let n = \V\ 
be the number of functions. We shall divide into two cases. 

Definition 2.7. • We say that Aq (resp. Aq ) has a dominant frequency vq G V if 
one has 

Jvq ^ 3v 

whenever j e Q(G) and v G V\{i>o}. 

• We say that Aq (resp. Aq) has two competing frequencies Vx,v% £ V if V\ ^ t>2 and 

whenever j e Q(G). A paraproduct with two competing frequencies v±,V2 is in stan- 
dard form if {f 2 } is not adjacent to V\{vi,V2} (resp. {t>*} U V\{vi, V2}). 

Note that the frequency j Vt plays no role in determining whether a maximal paraproduct 
has a dominant frequency or two competing frequencies. 

From (|i~2|), (|l"3|) and the introduction of edges of the form (v,v',0) or their duals, we 
observe that any paraproduct can be estimated by the sum of at most C n paraproducts 
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of the same order (each with complexity increased by at most C n ), such that each such 
paraproduct either has a dominant frequency or two competing frequency. Note that any 
paraproduct with two competing frequencies can be placed in standard form, by replacing 
any edge with vertex v 2 with the corresponding edge with vertex f 1; and then adding the 
edges (vi,v 2 ,0) and (v 2 ,v 1 ,0). 

If a non-maximal paraproduct Aq has a dominant frequency vq, then the summands 
have frequency approximately 2 l0n ^ v o, and so Aq vanishes identically. 

Now suppose a non- maximal paraproduct Ac has two competing frequencies v±, v 2 
and is in standard form. Assuming first V \ {vi,v 2 } 7^ we can use Cauchy-Schwarz to 
estimate 

1 j2 n^i^^c/^c/^^pi e n <&m 

j £ n(G)^y J * jen(G):j vl =j V2 =j,vev\{v u v 2 } 

where S v (f v ) is the Littlewood-Paley square function 

SM) :=(El^| 2 ) 1/2 ' 

k 

From this and Holder's inequality, and the boundedness of S v on L p , 1 < p < 00 we have 

^a((fv)vev) < Cpi ,p 2 1 1 1 1 pi 1 1 /u 2 1 1 Pa 1 1 ^G|v X { „ 2 } ( { } ) 1 1 ( 17 ) 

whenever 1 < Pi,p 2 ,P* < 00 are such that \jp\ + l/p 2 + 1/p* = 1. If V \ {^1,^2} = 0, 
then the above calculation holds without the factor involving the maximal paraproduct, 
and we immediately conclude Theorem |2.6| . 

From the above remarks we see that the estimate (|I5|) for non-maximal paraproducts 
automatically follows from the estimate ([16]) for maximal paraproducts of smaller order. 
Thus it remains to consider maximal paraproducts. 

To estimate maximal paraproducts we require some further definitions. 

Definition 2.8. • A maximal paraproduct A^H is separable if one can write V = 
V\ l±l V 2 where V\ and V 2 are non-empty and not adjacent. 

• A maximal paraproduct A^*' with dominant frequency v e V is said to be semi-direct 
if { v *} an d V\{ v o} are n °t adjacent. 

• A maximal paraproduct A^*' with two competing frequencies Vi,v 2 & V is said to be 
semi-direct if {v*} and V\{vi,v 2 } are not adjacent. 

• A maximal paraproduct is said to be good if it is of one of the above three types. 

We now show that the estimate (0) for good maximal paraproducts follows from an 
application of Theorem 12.61 applied to paraproducts of strictly lower order. 

\v 1 

If A G * is separable, then we have the pointwise estimate 

Ag* ] ((AW) < A^ u{ ^ } ((/,)^JAg|^ } ((^)^ 2 ). (18) 

The claim then follows from Holder. 

Now suppose A^H is semi-direct with dominant frequency vq. We shall assume first 
V \ {vo} is not empty. In this case we observe the identity 

e n<&/.= e e n<&/. 
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for all j*. We may of course assume that fl(G\[ Vt , iVo y) is non-empty. From the dom- 
inant frequency hypothesis we thus have j VQ > j v whenever j G Q(G — {f*}) and 
v G V^jfo}. 111 particular, the inner summand has Fourier transform supported on 
the annulus {2 10n ^- 3n < |f | < 2 10n i° +3n }. Since {j : (j,, j ) G Q(G\ {v , M )} is an interval 
(possibly infinite) for each v*, we may omit the constraint on j and replace it by an ap- 
propriate Fourier multiplier cutting of the very high and very low frequencies (not needed 
if the interval is infinite on either side). This Fourier multiplier can be estimated by the 
Hardy Littlewood maximal function. Thus we have the pointwise estimate 

i E Uq>m^ cm \E E II9*./«i 

jen(G):j v „=j* «ev ^ eZ jen(G-{v t }y.j V0 =j vev 

where M is the Hardy-Littlewood maximal operator. Taking suprema in and applying 
the Hardy-Littlewood maximal and Littlewood-Paley inequalities, we obtain 

iiAg* ] (Cf,w)iu < cikE i E n^i^i 2 ) 1/2 iu- 

ioez jen(G-{v*}y.j VQ =j vev 
The expression inside the norm on the right-hand side is pointwise bounded by 

Sv {fv )-^Q°}^j{{fv)veV\{v })- 

The claim then follows by Holder. The case V \ {vq} = is again an easy variant of the 
above without a smaller order maximal paraproduct. 

Finally, suppose A^*' is semi-direct with two competing frequencies vi,v 2 G V. We 
may place this paraproduct in standard form without affecting the property of being 
semi-direct. But we may then repeat the arguments used to prove QI7|), and obtain the 
pointwise estimate 

({fv}veV\{vi,va}), 

again with no maximal paraproduct on the right hand side if V \ {fijfa} is empty. The 
claim then follows from Holder and the boundedness of the Littlewood-Paley square func- 
tion on L p , 1 < p < oo. 

In light of the preceding discussion and induction, Theorem ^| (and hence Theorem 
|l.ip will follow if we can show 

Proposition 2.9. Every maximal paraproduct A^*^ can be bounded by a finite collection 
of good maximal paraproducts of the same order. The cardinality of the collection is 
bounded by a quantity depending only on the order and complexity of A^H . 

This will be done in the next two sections. The basic idea shall be to exploit variants 
of the identity 

Xa<6,a<c X.a<b<c Xa<c<b 

as well as fll3|) , ([14]) to permute the edges of the graph G into a good form. 
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3. The case of two competing frequencies 

It remains only to prove Proposition |2ll By the remarks in the previous section it 
suffices to verify the Proposition when has a dominant frequency or two competing 
frequencies. 

In this section we consider the case of two competing frequencies V\, v 2 ] this case is less 
technical than the dominant frequency case. We may place the maximal paraproduct in 
standard form. The vertex v 2 now plays no significant role, and we shall often work with 
the reduced graph G — {v 2 }. Let E — {v 2 } denote the edge set of G — {v 2 }. 

Write G = (VU {f*}, E). We shall induct on the complexity of G. Suppose inductively 
that the claim is proven for all maximal paraproducts AifcL , E ,-, with two competing 
frequencies Vi,v 2 in normal form, with \E'\ < \E\. 

In light of ( fn| ) we may replace any edge em. E — {v 2 } by its dual e* modulo objects of 
lower complexity, without disturbing the property of having two competing frequencies in 
standard form. Thus we may freely reverse the direction of edges in G — {v 2 } (adjusting 
the weights accordingly). 

We may assume that G — {v 2 } is connected as an undirected graph, for if G — {v 2 } split 
into two disconnected components then A^H is clearly good. 

We may also assume that G — {v 2 } contains no circuits as an undirected graph. To see 
this, we suppose for contradiction that G — {v 2 } contained an undirected circuit. By the 
freedom to reverse the direction of edges in G — {v 2 } we may assume that this circuit is 
directed, and has positive weight. But then A^H is identically zero, and we are done. 

From the preceding discussion we see that we may assume that G — {v 2 } is a tree (when 
viewed as an undirected graph), with no loops or double edges. We may then assume that 
the vertex v* is a leaf node (i.e. has degree exactly one), since A^*' becomes separable 
otherwise. 

We now induct on the distance from u„ to v\ in G—{v 2 } (viewed as an undirected graph). 
If this distance is 1 then A^H is semi-direct and we are done. Now suppose inductively 
that the distance is greater than 1. Then v* is adjacent to some vertex v' G V — {v 1, v 2 }, 
which is in turn adjacent to some vertex v " G V — {v 2 }, such that v" is closer to v\ than 
v' in G — {v 2 } (viewed as an undirected graph). 

By reversing edges if necessary we may assume that G contains the edges e\ = (v*, v', A{) 
and e 2 = (v',v",A 2 ). Observe that the statement 

jv* > 3v> + M and j Vf > j v » + A 1 + A 2 

holds if and only if one of the (exclusive) statements 

jv, > jv> + At and j v > > j> + A 

or 

jv, > jv" + A x + A 2 and j v » > j v < - A 2 + 1 
hold. From this and the triangle inequality we have 

Ag* ] (7) < Ag*2 {e2}u{e3} (/) + Ag!! {eije2}u{eSie3} (/) (19) 
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where e% := (v*,v",Ai + A%). The first term on the right-hand side is separable, while 
the second term on the right-hand side can be treated by the induction hypothesis. This 
completes the inductive argument. 

4. The case of a dominant frequency 



To conclude the proof of Proposition |2.9| and hence Theorem |1.1| it remains only to 
consider the case when G has a dominant frequency Vq. In this case we may freely replace 
G by G U H, where H is the graph with vertex set V U {v*} and edge set 

{(v ,v,l) :veV-{v }}, 

since this makes no difference to the maximal paraproduct AjiH other than to increase 
the complexity slightly. 

Fix V, vq, Vnt. The graph H is an example of a heirachy, which we now define. 

Definition 4.1. A heirarchy is a graph H — (V U {v *}, E') obeying the following proper- 
ties. 

• No edge of E' has v* as a vertex. 

• For every v G V — {vq} there exists a unique directed path p(v) = (ei, e<i, • • ■ ,e r ) of 
edges in E' from v to v. We refer to r, e r , and A(p(v)) as the ranking, link, and 
depth of v respectively, and define the superior s(v) of v to be the initial vertex of e r . 
We define vq to have ranking and depth, and no link or superior. 

• We have A(p(v)) > for all v G V — {v }. 

Note that if if is a heirarchy and G is any graph onFU {f*}, then A^' H automatically 
has vq as a dominant frequency. Indeed one has 

jv < jv — A{p{v)) (20) 

for all v G V and j G Q(H). Also observe that H — {v*} is necessarily a rooted tree with 
root Vq. 

Proposition [2.9| now clearly follows from 



Proposition 4.2. Let G be a graph on VU{v*} and let H be a heirarchy. Then A^jj^ can 
be bounded by finitely many good maximal paraproducts. The number of such paraproducts 
depends only on the complexity and order of G. 

Proof As before, we induct on the complexity of G. We may assume that the proposition 
is true for all graphs G of lower complexity and all heirarchies H. The inductive hypothesis 
is allowed to be vacuous when the complexity of G is zero. 

By the arguments in the preceding section, we may freely reverse edges in G (but not 
in H\), and assume that G contains no circuits as an undirected graph. 

We may assume that G does not contain any edge which has the same initial and final 
vertices as one in H, since the edge from G is either redundant or can be used to replace 
the edge in H, and in either case we can reduce the complexity of G. 

We divide into two cases, depending on whether G — {v*} is non-empty or not. 

Case 1. Suppose that 67 — {v*} is non-empty. Then G contains an edge e = (v, v', A ) 
for some v, v' G V. By replacing e with its dual if necessary we may assume that 

AQ + A(p{v))>A(p(v')) . (21) 
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If v' G p(v) (i.e., the path p(v) crosses the vertex v',) then the condition ( p]) and the 
fact that if is a heirarchy forces G U H to contain a directed circuit of positive weight, so 
that A^y H vanishes. Hence we may assume that v' $ p(v). In particular we have v' ^ v . 

Let ex be the link of v'. From (|HD we have 

A^ 1 ( f) < A [v * ] ( f) + A [v * ] ( f) 

The graph G U H — {e{\ can be written as the union of G — {e} and H — {e{} U {e}. 



Since if is a heirarchy, one can verify using (EI) that H — {ei} U {e} is also a heirarchy 



(in fact, the depth of any vertex either stays constant or increases). Thus this summand 
is acceptable by the induction hypothesis. 

Now consider the contribution of G U H — {ex} U {e*}. If the superior s(v') of v' is equal 
to vq then one can see using ( pTT ) that this graph contains a directed circuit of positive 
weight, so the contribution of this graph vanishes. Now suppose s(v') ^ v . Let e 2 be the 
link of s(v'). 

Observe from (p0|) , that the edge e 2 is redundant in G U H — {ei} U {el}: 



Q(G UH-{ ei }U {e\}) = n(G UH-{e u e 2 } U {e*}). 

The graph G U H — {e\,e 2 } U {e^} can be written as the union of G — {e} and H — 
{ei, e 2 } U {e, e*}. The latter graph can be verified to be a heirarchy using (pip by similar 
arguments as before, and so this case is also treatable by the induction hypothesis. This 
concludes the treatment of Case 1. 

Case 2. Suppose that G — {v*} is empty. Define the total ranking to be the sum of 
rankings of all the vertices in V which are adjacent to v*. 

We induct on the total ranking. If the total ranking is zero then A^Jj H is semi-direct 
with dominant frequency Vq. Now suppose the total ranking is positive. Then is 
adjacent to at least one vertex v in V — {vq}. By reversing edges if necessary, we may 
assume that G contains an edge e of the form e = (t>*, v, A ). 

Let e% = (s(v),v, A\) be the link of v, and let e 2 denote the edge e 2 := (u*, s(v), Aq—Ai). 
By repeating the derivation of ([19]) we have the triangle inequality 

GUH\J I — il G-{e)U{e 2 )UffwJ ^ XV GUH-{ei}U{e%}\J J " 

The first paraproduct on the right-hand side is of the same form as G U H, but has the 
total ranking reduced by one, and so can be handled by the induction hypothesis. Since 
H is a tree, the second paraproduct is separable. This concludes the treatment of Case 
2. ■ 



Proposition ^9] has now been proven in all cases. The proof of Theorem [0| is now 
complete. 



5. Proof of Corollary 1.2 



Fix n, Mi, . . . , M n , itjj as in Corollary [L2|. Let m^j denote the symbol of ~Kj^. By the 
hypotheses of the Corollary, pigeonholing, and symmetry we may assume that mj >n (0) = 
for all j G Z. By shifting the summation index we may assume M n = 0. 
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We may replace the left-hand side of (|T0|) by 



J2 e i / II n 3,ifi 



i=i 

where 6j are bounded constants. By absorbing the ej into vn^ n (for instance) we may 
assume that e 3 - = 1. 

Following [|ll[| , we now renormalize the in a form which allows a good decomposition 
as a telescopic series. For each integer j, let 0^ be a bump function adapted to [— 2 J , 2 J ] 
which satisfies 0j(O) = 1. 

For any 1 < % < n, we decompose 

m^i = ni :,({))<:):. u + (m jti - m^O)^^). 

Thus by decomposition, we may assume that 

rrij t i is a bounded multiple of 4>j+Mi or vanishes at the origin. 

Of course when % = n the latter possibility obtains. Whenever the case m^O) <pj + M t 
obtains, we may replace this function by ^j+Mi an d replace m Jin by m(j, i)(0)m Jjn . Thus 
we may assume 

m^i is equal to 4>j+Mi or vanishes at the origin. 

We set ] n = j. If nij ti vanishes at the origin, we set ji = j n + Mj. If rrij^ = <fij+Mv we 
consider the lacunary decomposition 

fa+Mi = Yl fai - hi- 1 ■ 

ji<jn+Mi 



The claim then follows by an application of Theorem 1.1 
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